The viewpoint complexity of an object-recognition task
نویسندگان
چکیده
There is an ongoing debate about the nature of perceptual representation in human object recognition. Resolution of this debate has been hampered by the lack of a metric for assessing the representational requirements of a recognition task. To recognize a member of a given set of 3-D objects, how much detail must the objects' representations contain in order to achieve a specific accuracy criterion? From the performance of an ideal observer, we derived a quantity called the view complexity (VX) to measure the required granularity of representation. VX is an intrinsic property of the object-recognition task, taking into account both the object ensemble and the type of decision required of an observer. It does not depend on the visual representation or processing used by the observer. VX can be interpreted as the number of randomly selected 2-D images needed to represent the decision boundaries in the image space of a 3-D object-recognition task. A low VX means the task is inherently more viewpoint invariant and a high VX means it is inherently more viewpoint dependent. By measuring the VX of recognition tasks with different object sets, we show that the current confusion about the nature of human perceptual representation is partly due to a failure in distinguishing between human visual processing and the properties of a task and its stimuli. We find general correspondence between the VX of a recognition task and the published human data on viewpoint dependence. Exceptions in this relationship motivated us to propose the view-rate hypothesis: human visual performance is limited by the equivalent number of 2-D image views that can be processed per unit time.
منابع مشابه
Distinct visual perspective-taking strategies involve the left and right medial temporal lobe structures differently.
This study assesses the role of the human medial temporal lobe (MTL) structures in the coordination of spatial information across perspective change and, in particular, in visual perspective taking--namely the capacity to know what another individual is seeing on the visual scene. Fourteen patients with unilateral temporal lobe resection and 21 control subjects performed two tasks, called 'obje...
متن کاملParallel Spatial Pyramid Match Kernel Algorithm for Object Recognition using a Cluster of Computers
This paper parallelizes the spatial pyramid match kernel (SPK) implementation. SPK is one of the most usable kernel methods, along with support vector machine classifier, with high accuracy in object recognition. MATLAB parallel computing toolbox has been used to parallelize SPK. In this implementation, MATLAB Message Passing Interface (MPI) functions and features included in the toolbox help u...
متن کاملAM281, Cannabinoid Antagonist/Inverse agonist, Ameliorates Scopolamine-Induced Cognitive Deficit
Objective(s) Cannabinoids have been implicated in memory deficit. We examined the effect of AM281, cannabinoid antagonist/inverse agonist in prevention of scopolamine-induced cognitive deficit. Materials and Methods Object recognition task was used to evaluate memory in mice. Exploration time in the first and the second trial was recorded. The differences in exploration between a previously...
متن کاملLow-level correlations between object properties and viewpoint can cause viewpoint-dependent object recognition.
Viewpoint-dependent recognition performance of 3-D objects has often been taken as an indication of a viewpoint-dependent object representation. This viewpoint dependence is most often found using metrically manipulated objects. We aim to investigate whether instead these results can be explained by viewpoint and object property (e.g. curvature) information not being processed independently at ...
متن کاملAn Investigation into the Effects of Joint Planning on Complexity, Accuracy, and Fluency across Task Complexity
The current study aimed to examine the effects of strategic planning, online planning, strategic planning and online planning combined (joint planning), and no planning on the complexity, accuracy, and fluency of oral productions in two simple and complex narrative tasks. Eighty advanced EFL learners performed one simple narrative task and a complex narrative task with 20 minutes in between. Th...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Vision Research
دوره 38 شماره
صفحات -
تاریخ انتشار 1998